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WE CLAIM : 

1 . A method of predicting the occurrence of a selected weather event, the 
method comprising the steps of: 

identifying a first set of parameters that relate to the selected 
weather event, wherein the parameters in the first set are statistically correlated; 
5 identifying a data set comprised of a plurality of observations for 

the weather event over a selected geographical area, each observation 
containing values for the parameters; 

performing a Principal Component Analysis (PCA) to identify and 
select a reduced set of principal components corresponding to a second set of 
10 parameters, the reduced set of principal components being uncorrelated, the 
second set of parameters being a subset of the first set of parameters, the 
parameters in the second set of parameters contributing at least a majority of 
the variance of the first set of parameters; 

rotating the reduced set of principal components to provide a third 
15 set of parameters, wherein each parameter corresponds to a physical data 
measurement; and 

performing a logistic regression on the third set of parameters to 
determine the predictive equation for forecasting of the selected weather event. 

2. The method of Claim 1, wherein the step of rotating the second set of 
parameters is accomplished by a linear transformation. 

3. The method of Claim 1, wherein the step of identifying the data set of 
observations comprises the formation of a data matrix [x] having n rows and m 
columns, each row representing the /-th observation and each column 
representing the y-th parameter, where /' is an index from 1 to n and j is an index 

5 from 1 to m. 
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4. The method of Claim 3, wherein the step of performing the Principal 
Component Analysis (PCA) further comprises the step of 

calculating a scaled data matrix [z] according to the following 

equation: 

5 

n 

where — [lpf] is the mean of [x], [l] is the unity matrix whose elements are all 

n 

equal to unity, and is the diagonal matrix with diagonal elements being the 
reciprocals of the sample standard deviations of the m parameters and the 
10 remaining element being 0, 

whereby scaling the data allows parameters with different magnitudes of 
variance to be equally considered. 

5. The method of Claim 4, wherein the step of performing the Principal 
Component Analysis (PCA) further comprises the step of 

calculating a correlation matrix [r] from the scaled data matrix [z] 
according to the following equation: 

5 

n-l 

6. The method of Claim 5, wherein the step of performing the Principal 
Component Analysis (PCA) further comprises the step of 

calculating an eigenvector matrix [e] and an eigenvalue matrix 
[a], both of which are derived from the correlation matrix [r]. 



Page 42 of 48 



Patent Docket No. 
02-00087-02 

7. The method of Claim 6, wherein the step of performing the Principal 
Component Analysis (PCA) further comprises the step of 

calculating an PC loading matrix [a] according to the following 

equation: 

8. The method of Claim 7, wherein the step of performing the Principal 
Component Analysis (PCA) further comprises the step of 

ordering the parameters by descending values of the eigenvalues 
calculated for each parameter; and 

selecting the p parameters that account for at least a majority of 
the variance observed in the observations. 

9. The method of Claim 8, wherein selecting the p parameters is 
accomplished by plotting the parameters in descending order on a graph and 
selecting those parameters that are to the left of where the slope of the graph of 
the parameters approaches a selected distance from 0. 

10. The method of Claim 1, wherein the definition of the geographical area 
comprises a regular grid. 

1 1 . The method of Claim 1 , wherein the definition of the geographical area 
comprises a set of discrete points. 

12. The method of Claim 1 , wherein the predictive equation for forecasting of 
the selected weather event provides a probability of a binary event. 

13. A system for forecasting the probability of occurrence of a selected 
weather event, the system comprising: 
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selecting a geographical area, the geographical area being divided 
into a regular grid; 

developing predictors for observed values of atmospheric 
phenomena within the geographical area; 

collecting a plurality of observations of the predictors for each 
portion of the grid over a period of time; 

forming a data matrix [x] having n rows and m columns, each row 
representing the /-th observation and each column representing the y-th 
parameter, where /' is an index from 1 to n and J is an index from 1 to m; 

normalizing the data within the data matrix to provide a scaled 
data matrix [z], wherein parameters having different magnitudes of variance 
may be equally weighted; 

calculating a correlation matrix [r] from the scaled data matrix 

[z]; 

calculating an eigenvector matrix [e] containing eigenvectors 
calculated from the correlation matrix [r] and an eigenvalue matrix [a] 
containing eigenvalues calculated from the respective eigenvectors, wherein the 
eigenvector matrix [e] provides a set of orthogonal vectors relating to variance; 

calculating a PC loadings matrix [a] from the eigenvector matrix 
[e] and the eigenvalue matrix [a]; 

calculating the PC score matrix [f] from the scaled data matrix 
[z] and the PC loadings matrix [a]] and, 

performing a logistic regression using the PC score matrix [f] to 
provide a probability for the occurrence of the selected weather-related event. 
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14. The system of Claim 13, further comprising: 

forming a reduced set of eigenvectors, wherein the eigenvectors in 
the reduced set correspond to parameters that account for at least a majority of 
the total variance of the parameters in the data set; 
5 performing an orthogonal linear transformation on the reduced set 

of eigenvectors for form a rotated set of eigenvectors; and, 

calculating the PC loadings matrix [a] calculated from an 
eigenvector matrix [e] consisting of the rotated set of eigenvectors and an 
eigenvalue matrix [a] consisting of the eigenvalues corresponding to the 
1 0 rotated set of eigenvectors. 



16. A computer program product for forecasting the occurrence of a weather 
event, the computer program product embodied on one or more computer- 
readable media and comprising: 

computer-readable program code means for input of a data set 
5 comprised of observations of data parameters made during a known time period 
at geographical sites; 

computer-readable program code means for performing a 
Principal Component Analysis (PCA) to identify and select a reduced set of 
principal components corresponding to a subset of the data parameters, the 
10 reduced set of principal components being uncorrected, the parameters in the 
subset of parameters contributing at least a majority of the variance of all the 
parameters; 

computer-readable program code means for performing a logistic 
regression to determine the predictive equations for forecasting the occurrence 
1 5 of the selected weather event. 
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16. The computer program product described in Claim 15, wherein the 
computer-readable program code means for performing a Principal Component 
Analysis (PCA) comprises 

a means for forming a data matrix [x] having n rows and m 
5 columns, each row representing the /-th observation and each column 
representing the /-th parameter, where / is an index from 1 to n and j is an index 
from 1 to m\ 

a means for normalizing the data within the data matrix to provide 
a scaled data matrix [z], wherein parameters having different magnitudes of 
1 0 variance may be equally weighted; 

a means for calculating a correlation matrix [r] from the scaled 
data matrix [z] ; 

a means for calculating an eigenvector matrix [e] containing a first 
set of eigenvectors calculated from the correlation matrix [r] and an first 
15 eigenvalue matrix [a] containing eigenvalues calculated from the respective 
eigenvectors, wherein the eigenvector matrix [e] provides a set of orthogonal 
vectors relating to variance; 

a means for identifying an eigenvector matrix [e] consisting of a 

second set of eigenvectors corresponding to parameters that account for at 
20 least a majority of the total variance of the parameters in the data set; 

a means for performing an orthogonal linear transformation on the 
second set of eigenvectors for form a rotated set of eigenvectors; and, 

a means for calculating a PC loadings matrix [a] from the 
eigenvector matrix [e] consisting of the rotated set of eigenvectors and the 
25 eigenvalue matrix [a] consisting of the eigenvalues corresponding to the 
rotated set of eigenvectors; 
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a means for calculating a PC loadings matrix [a] from the 
eigenvector matrix [e] and the eigenvalue matrix [a] corresponding to the 
rotated set of eigenvectors; and, 

a means for calculating the PC score matrix [f] from the scaled 

data matrix [z] and the PC loadings matrix [a], the PC score matrix [f] 
provided as input to the logistic regression. 
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